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SUMMARY 

paper  contains  a  general  survey  of  the  mathematical 
theory  of  zero-~*um  two-person  games .(  >I^--waa~|M%para£L. for 
publication  in  the  Applied  Mechanics  Renews. 
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THEORY  OF  GAMES  OF  STRATEGY 


In  many  economic,  military,  and  operations  research 
problems,  the  game  factor  dominates — i.e.,  the  outcome  or 
result  can  only  be  described  in  terns  of  the  decisions  made 
by  several  participants,  each  having  a  different  objective. 

The  Theory  of  Games,  a  relatively  new  branch  of  mathematics, 
analyzes  such  strategic  problems  by  studying  the  following 
theoretical  model  patterned  on  actual  parlor  games  such  as 
Chess  and  Poker:  *  participant  knows  that  one  of  several 
possible  events  will  occur,  and  with  respect  to  these  events 
he  has  certain  preferences.  He  lacks  full  control  over  the 
variables  which  determine  the  event.  Although  he  has  some 
control,  other  participants,  who  have  different  objectives, 
also  influence  the  event.  Further,  the  event  nay  be  influenced 
by  random  elements. 

Games  of  chance  have  been  studied  mathematically  ror  many 
years,  and  the  mathematical  theory  of  probability  was  developed 
from  their  study.,  Although  strategic  situations  have  long  beer 
observed  and  recorded,  the  first  attempt  to  abstract  them  into 
a  mathematical  theory  of  strategy  was  made  In  1921  by  Emile 
Borel.  The  theory  was  firmly  established  ny  John  von  Neumann 
ir  1928  when  he  proved  the  Minimax  Theorem ,  However,  it  was 


the  publication  in  of  the  impressive  work 


*ory  of  Games 


and  Economic  Behavior,  by  John  von  Neumann  <And  Oskar  Me^genatero , 
that  stimulated  research  m  the  mathematical  theory  of  games. 
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By  &  game  Is  meant  a  set  of  rules  which  specify 
unambiguously  the  number  of  players,  the  moves  that  each 
player  may  or  must  make  under  all  possible  circumstances, 
the  moves  that  are  made  for  the  players  by  chance,  the  amount 
of  information  available  to  the  players,  and  the  payoff  to 
each  player.  Von  Neumann  has  given  a  mathematically  precise 
definition  of  a  game  by  making  use  of  the  notion  of  partition 
of  sets. 

Although  each  game  is  initially  described  in  terras  of 
its  moves  and  the  amount  of  information  available  to  the 
players,  we  can  normalize  the  game  by  the  introduction  of 
the  concept  of  a  strategy.  In  the  actual  play  of  the  game, 
each  player,  instead  of  making  his  decision  at  each  move 
of  the  game,  may  formulate  a  complete  plan  for  playing  the 
game  from  beginning  to  end,  for  every  situation  that  may 
arise.  Such  a  plan  is  referred  to  as  a  strategy.  It  takes 
into  account  any  information  that  may  become  available  in  the 
course  of  the  play  of  the  game.  No  freedom  of  action  is 
lost  through  the  use  of  a  strategy,  since  a  strategy  specifies 
a  player's  actions  in  terms  of  the  Information  that  may  be 
available  in  accordance  with  the  rules  of  the  game. 

Kvery  game  can  be  described  in  terras  of  the  sets  of 
strategies  of  the  players,  where  each  strategy  is  a  player's 
method  of  playing  a  given  game  from  beginning  to  end.  Every 
combination  of  strategies,  one  from  each  player,  will  determine 
an  outcome  of  the  game  which  is  measured  by  a  payoff  to  each 
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player.  Thus  the  game  is  determined  by  the  number  of  players > 
their  strategies,  and  their  payoff  functions.  If  a  player  has 
n  different  strategies,  they  may  be  identified  by  the  numbers 
1,  2,  . . . ,  n. 

The  fundamental  problem  of  the  theory  of  games  of  strategy 
is  the  determination  of  optimal  strategies  for  each  player  and 
to  evaluate  the  amount  each  player  can  expect  to  receive.  No 
satisfactory  theory  exists  as  yet  which  solves  the  problem  for 
an  arbitrary  number  of  players.  In  general,  the  difficulties 
are  both  computational  and  conceptual.  However,  the  fundamental 
case  of  two  players  with  opposing  interests  is  conceptually 
complete  and  much  progress  has  been  made  on  the  technical  problems. 

1.  Oames  with  Finite  Number  of  Strategies 

The  mathematical  model  for  games  with  two  players  having, 
atrlotly  opposing  interests  itf  deceptively  simple i  Player  I 
chooses  a  strategy,  any  one  of  the  numbers  i  •  1,  2,  ...,  m, 
and  Player  II  chooses  a  strategy,  any  one  of  the  numbers 
j  •  1,  2,  ...,  n,  each  choice  being  made  without  any  knowledge 
of  the  other.  The  payoff  to  Player  I  is  a  function  of  the 
choeen  strategiee,  a^j,  while  the  payoff  to  Player  II  is  -a^. 

The  objective  of  Player  I  is  to  maximise  a^,  but  he  controls 
only  the  oholce  of  1,  whi'.e  the  objective  of  hie  opponent  Is 
to  maximise  -a^  and  he  oontrols  the  oholce  of  j.  What  are  the 
guiding  principles  which  should  determine  the  choices  and  what 
is  the  expected  outcome  of  the  game?  Ve  have  assumed  that  the 
outcome  can  be  measured  quantitatively  by  a  number  and  that 
the  notion  of  expectation  is  applicable  for  this  measurement. 
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Among  Player  I‘s  strategies  there  exists  some  strategy 
such  that  he  can  obtain  a  payoff  of  at'  least  max  p»in  a.  . 

4  4  ij 


Player  II  has  some  strategy  such  that  he  pays  no  more  than 
li 

J 


min  max  a...  For  every  matrix  (at .  . ) #  we  have 
^  i  j  i  j 


max  min  a. -  <  min  max  a.  . . 
1  J  1J  “  J  1  1J 


If  the  game  is  such  that 


max  min  a. .  ■  min  max  a,  .  «  v, 
i  J  1J  J  i 


then  Player  I  has  a  strategy  which  yields  him  at  least  v,  and 
he  can  be  kept  from  getting  more  than  v  by  the  second  player. 
Thus  in  this  case,  there  are  optimal  strategies  i*  for  the  first 
playsr  and  J*  for  the  second  player  which  have  the  following 
properties:  (l)  if  Player  I  chooses  i#,  then,  no  matter  what 
Player  II  does,  Player  I  will  get  at  least  v;  (2)  if  Player  II 
chooses  J#,  then,  no  matter  what  Player  I  does.  Player  I  will 
get  at  most  v;  and  (3)  if  Player  I  were  to  announce  in  advance 
of  the  play  that  he  plans  to  use  strategy  i#.  Player  II  could 
not  take  advantage  of  this  information  and  thereby  reduce 
Player  I's  payoff. 

A  necessary  and  sufficient  condition  that  max  min  a. .  • 

i  J  1J 

min  max  a, .  is  that  the  payoff  matrix  (a.  .)  have  a  saddle-point; 

j  i  U  ij 

i.e.,  that  there  exists  an  element  of  the  matrix  which  Is  at  the 
same  time  the  minimum  of  its  row  and  the  maximum  of  its  column. 


If  the  payoff  matrix  (a^j)  la  such  that  it  contains  no 
saddle— point,  and  hence  neither  player  has  an  optimal  strategy, 
it  la  necessary  to  generalise  the  notion  of  a  strategy.  A 
player,  instead  of  ohooalng  a  single  strategy,  chooses  a 
probability  distribution  over  his  set  of  strategies  and  the 
particular  strategy  for  the  play  of  the  game  ir  chosen  by  some 
chanoe  device  satisfying  this  probability  distribution.  Each 
probability  distribution  over  the  set  of  strategies  is  a  mixed 
strategy.  This  randomisation  by  a  player  protects  him  against 
choosing  a  strategy  which  would  benefit  his  opponent.  Of 
course,  any  strategy  can  be  regarded  as  a  mixed  strategy. 

However,  to  evaluate  the  effect  of  a  mixed  strategy,  the  expected 
value  of  the  effect  of  the  strategies  needs  to  be  determined. 

Let  Sm  and  3n  be  the  sets  of  mixed  strategies  of  Player  I 
and  Player  II,  respectively.  Let  B(X,  Y)  be  the  expected  payoff 
received  by  Player  I  if  he  uses  mixed  strategy  X  and  Player  II 
uses  mixed  strategy  Yj  then  It  turns  out  that  in  any  game  with 
a  finite  number  of  strategies 

Max  Min  B(X,  Y)  -  Min  Max  B(X,  Y)  -  v. 

Y«n  Y€Sn  X«m 

This  is  the  Main  Theorem  of  finite  games,  and  was  first  proved 
by  von  Neumann  in  1928.  It  shows  that  both  players  have  optimal 
mixed  strategies,  i.e.,  there  exists  a  probability  distribution 
for  Player  I  which  assures  him  a  gain  of  at  least  v  on  the 
average,  and  there  exists  a  probability  distribution  for  Player  II 
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which  insures  him  against  a  loss  of  more  than  v  on  the 
average. 


2.  Solution  of  Plnlte  Games 

The  sets  of  optimal  strategies  can  be  characterized 
geometrically  as  closed  convex  polyhedra.  Thus  to  find  all 
optimal  strategies  we  need  only  determine  the  extreme  points 
of  the  convex  set.  L.  S.  Shapley  and  R.  N.  Snow  have  shown 
that  each  extreme  point  is  associated  with  some  nonslngul&r 
square  submatrix  of  the  payoff  matrix  (a^j).  Thus  every  optimal 
strategy  which  is  an  extreme  point  can  be  obtained  as  the 
unique  solution  of  a  suitably  chosen  subsystem  of  linear 
equations. 

There  exist  iterative  procedures  for  approximating 
optimal  mixed  strategies.  The  two  most  common  procedures  are 
due  to  Oeorge  W.  Brown  and  John  von  Neumann.  The  Brown  pro¬ 
cedure  is  a  method  of  fictitious  play  which  bases  future 
decisions  on  relevant  past  history.  In  the  von  Neumann  pro¬ 
cedure  the  steady  state  solution  of  systems  of  differential 
equations  corresponds  to  the  optimal  strategies  of  the  game. 

3.  Games  with  Infinite  Strategies 

Although  most  parlor  games  require  the  evaluation  of  a 
finite  number  of  strategies,  many  military  and  economic  games 
have  an  Infinite  number  of  strategies.  For  example,  in  such 
games  It  is  frequently  required  to  choose  an  optimal  percentage 


P-99* 

12-17-^6 

from  an  Infinite  number  of  percentage*.  In  an  infinite  game, 
the  two  players  choose  strategies  x  and  y  from  infinite  sets 
S1  and  S2.  The  payoff  is  the  value  of  a  function  M(x,  y). 

By  analogy  with  the  finite  games,  if 

Max  Min  M(x,  y)  <  Min  Max  M(x,  y) 
xcS^  yfSg  y«Sg  x«S1 

then  mixed  strategies  are  required They  are  defined  as  proba¬ 
bility  distributions  F  and  0  over  and  8g.  Now  Player  I's 
expectation  Is  given  by  the  double  integral 

*(F.  0)  -//*(*.  y)d»(x)dO(y). 

The  existence  of  optimal  mixed  strategies,  l.e. 

Max  Min  K(P,  0)  -  Min  Max  E(F,  a) 

F  0  OF 

now  depends  on  the  function  M(x,  y).  If  M(x,  y)  is  continuous, 
with  3l  and  Sg  unit  intervals  0  £  x  £  1,  0  £  y  £  1,  then  optimal 
mixed  strategies  exist.  However,  if  M(x,  y)  is  discontinuous, 
optimal  strategies  need  not  exist.  Even  when  optimal  strategies 
exist,  as  in  the  oase  of  continuous  M(x,  y),  no  general  method 
exists  for  computing  them. 

The  method  of  solution  of  infinite  games  depends  upon  the 
functional  form  of  N(x,  y).  In  many  applications  the  payoff 
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function  M(x,  y)  Is  strictly  convex  in  y  for  each  x,  In  which 

case  an  optimal  strategy  for  Player  II  is  to  choose  that  y 

which  minimizes  Max  M(x,  y).  However,  Player  I  must  use  a 

x 

mixed  strategy  generally  consisting  of  two  strategies.  The 

value  of  the  game  is  Min  Max  M(x,  y). 

y  x 

If  the  payoff  function  is  a  polynomial,  i.e..  If 

m,n  ,  . 

*(x,  y)  -  J  aijx  y  » 
i,J-0 

then  both  players  have  optimal  mixed  strategies  which  are  step 
functions.  Each  player  randomizes  on  1/2  min  (m  +  2,  n  +  2) 
strategies,  at  most. 

4.  Examples  of  Infinite  dames 

ALLOCATION  OP  RESOURCES.  The  xollowing  attack-defense 
game  has  interpretations  in  economics  as  well  as  in  military 
planning.  Given  n  targets  T^,  T^,  ...,  Tr  whose  values  are 
vl#  v2,  ...»  v  ,  respectively.  Suppose  Player  I  has  a  total 
of  A  attacking  units  and  Player  II  has  a  total  of  D  defensive 
units.  How  should  che  players  allocate  their  forces  among  the 
n  targets?  Let  us  assume  that  if  x  -  (x^,  x2*  . ..,  *0)*  where 
each  is  nonnegative  and  £  x^  •  A,  is  am  allocation  of  Player 
I's  resources  among  the  n  targets,  and  if  y  *  (y^,  y^,  ...,  yn) 
ia  an  allocation  of  the  defender's  forces  among  the  targets, 
then  the  payoff  to  Player  I  Is  defined  by 
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n 

${x,  y)  -  £  Vj  max  (0,  -  yi). 

1*1 

This  is  an  n-'i  dimensional  infinite  ga ire  with  a  contlnuoub 
payoff  and  hence  optimal  strategies  exist.  If  A  £  D,  then  the 
optimal  strategy  for  the  defender  is  to  defend  only  the  high— 
valued  t&rr*  s  and  le«ve  low— valued  targets  undefended.  However, 
the  attacker  must  us®  a  mixed  strategy.  He  selects  one  of  the 
targets  at  random.  subject  to  a  given  probability  distribution, 
and  Allocates  his  entire  force  A  to  that  target.  An  interesting 
property  of  the  optimal  strategies  ia  that  the  high-valued  targets 
which  are  defended  are  also  the  targets  which  may,  depending  on 
the  outcome  of  the  randomization,  receive  the  concentrated  attack. 
The  low— valued  targets  are  undefended  and  never  attacked.  Further, 
tne  optlr'l  strategies  x*  and  y*  are  such  that  there  exist  nu 
soft  t  jo te ■  i.e., 

v, ( A  -  y^)  *  v  «  constant,  if  >  C 

v^A  <  V,  yi  -  0. 

71*13*3  OF  DECISIONS.  Many  timing  problems  can  be  viewed 
«a  two— player  games  whose  rules  describe  the  actions  which  the 
players  are  to  take,  but  the  timing  of  the  actions  to  be 
determined  by  the  players.  Ir  these  games  each  player  wishes  to 
>*?.ay  the  actions  as  long  as  possible  but  he  is  also  penalized 
for  delaying  by  his  opponent.  This  conflict  of  interests  can 
ba  resolved  and  a  best  timing  of  actions  can  be  determined  for 
**ch  player. 
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Let  ue  assume  that  each  player  has  to  choose  only  one 
action  time.  Let  us  also  assume  that  each  player  ie  informed 
of  his  opponent's  action  as  soon  it  takes  place.  Define 
P1 ( x }  as  the  probability  that  Player  1  will  succeed  if  he  acts 
i\t  time  x,  and  > }  as  the  probability  that  Player  II  will 
succeed  If  he  acte  at  time  y.  Let  the  payoff  be  vl  to  the 
successful  player  and  0  to  both  players  if  both  or  neither 
succeeds,  then  the  expected  payoff  to  Player  I  as  a  function 
of  action  times  x  and  y  becomes: 

j'2Pi(x)  -  1  if  x  <  y 

M(x,  y)  ~^P1(x)  -  P2(x)  if  x  -  y 

(l  -  ?P2( y)  it  x  >  y. 

In  this  case  the  optimal  action  time  for  each  player  depend® 
o;  the  solution  of  the  equation 

F1(t)  *  Pg(t)  -  1. 

5a  h  player  delays  his  action  until  t,  if  his  opponent  has 

not  acted  prior  to  t.  If  his  opponent  has  acted  prior  to  t, 

then  the  player  acta  at  time  x  such  that  P^x)  *  1>  l.e.,  the 
player  waits  until  he  is  crrtair.  of  success. 

if  in  the  preceding  e^enrple  wo  remove  the  information 
aspect  and  assume  that  each  player  is  ignorant  of  the  action, 
if  any,  taken  by  his  opponent,  then  optimal  timing  requires 
ran domination  by  both  players.  Let  us  irtirkc  the  additional 
assumption  tn&t  P^(xj  »  ( x } }  then  the  payoff  function  becomes 
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y  4-  (1  ♦  y)x 

M(x,  y)  »  ^  0 

-y  ♦  (l  -  y)* 


If  x  <  y 
if  x  -  y 
if  x  >  y 


where  x  and  y  are  the  probabilities  of  success  or  Player  I 
and  Player  II,  respectively.  The  two  players  have  the  same 
optimal  strategies— -del  *y  until  the  probability  of  success  is 
at  least  1/3,  then  act  at  a  time  chosen  at  random  subject  to 
the  probability  distribution 


F{x)  -  0 


o  £,  x  £  1/} 


1/5  £  x  <.  1  • 
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